Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 27323 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 2 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 2.7 MiB |
| Average record size in memory | 104.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 3 |
Reproduction
| Analysis started | 2020-06-05 07:45:55.800871 |
|---|---|
| Analysis finished | 2020-06-05 07:47:13.350306 |
| Duration | 1 minute and 17.55 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Dataset has 2 (< 0.1%) duplicate rows | Duplicates |
Date has a high cardinality: 253 distinct values | High cardinality |
region has a high cardinality: 78 distinct values | High cardinality |
4046 is highly correlated with Total Volume and 3 other fields | High correlation |
Total Volume is highly correlated with 4046 and 3 other fields | High correlation |
4225 is highly correlated with Total Volume and 2 other fields | High correlation |
Total Bags is highly correlated with Total Volume and 3 other fields | High correlation |
Small Bags is highly correlated with Total Volume and 3 other fields | High correlation |
Large Bags is highly correlated with Total Bags | High correlation |
Date is uniformly distributed | Uniform |
4046 has 345 (1.3%) zeros | Zeros |
4770 has 8496 (31.1%) zeros | Zeros |
Large Bags has 2952 (10.8%) zeros | Zeros |
XLarge Bags has 16567 (60.6%) zeros | Zeros |
| Distinct count | 253 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 213.5 KiB |
| 2017-03-05 | 109 |
|---|---|
| 2017-02-26 | 109 |
| 2019-08-18 | 108 |
| 2018-01-21 | 108 |
| 2019-07-28 | 108 |
| Other values (248) |
| Value | Count | Frequency (%) | |
| 2017-03-05 | 109 | 0.4% | |
| 2017-02-26 | 109 | 0.4% | |
| 2019-08-18 | 108 | 0.4% | |
| 2018-01-21 | 108 | 0.4% | |
| 2019-07-28 | 108 | 0.4% | |
| 2019-07-21 | 108 | 0.4% | |
| 2019-01-27 | 108 | 0.4% | |
| 2017-03-19 | 108 | 0.4% | |
| 2015-11-01 | 108 | 0.4% | |
| 2015-01-04 | 108 | 0.4% | |
| Other values (243) | 26241 | 96.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
AveragePrice
Real number (ℝ≥0)
| Distinct count | 260 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.400632434212934 |
|---|---|
| Minimum | 0.44 |
| Maximum | 3.25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0.44 |
|---|---|
| 5-th percentile | 0.85 |
| Q1 | 1.11 |
| median | 1.37 |
| Q3 | 1.64 |
| 95-th percentile | 2.09 |
| Maximum | 3.25 |
| Range | 2.81 |
| Interquartile range (IQR) | 0.53 |
Descriptive statistics
| Standard deviation | 0.3854387199 |
|---|---|
| Coefficient of variation (CV) | 0.2751890578 |
| Kurtosis | 0.4001088522 |
| Mean | 1.400632434 |
| Median Absolute Deviation (MAD) | 0.26 |
| Skewness | 0.5980915978 |
| Sum | 38269.48 |
| Variance | 0.1485630068 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1.15 | 310 | 1.1% | |
| 1.19 | 309 | 1.1% | |
| 1.16 | 303 | 1.1% | |
| 1.14 | 298 | 1.1% | |
| 1.26 | 294 | 1.1% | |
| 1.25 | 290 | 1.1% | |
| 1.18 | 284 | 1.0% | |
| 1.23 | 283 | 1.0% | |
| 1.44 | 282 | 1.0% | |
| 1.13 | 282 | 1.0% | |
| Other values (250) | 24388 | 89.3% |
| Value | Count | Frequency (%) | |
| 0.44 | 1 | < 0.1% | |
| 0.46 | 1 | < 0.1% | |
| 0.48 | 1 | < 0.1% | |
| 0.49 | 2 | < 0.1% | |
| 0.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3.25 | 1 | < 0.1% | |
| 3.17 | 1 | < 0.1% | |
| 3.12 | 1 | < 0.1% | |
| 3.05 | 1 | < 0.1% | |
| 3.04 | 1 | < 0.1% |
| Distinct count | 27296 |
|---|---|
| Unique (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 913546.9352911466 |
|---|---|
| Minimum | 84.56 |
| Maximum | 63716144.15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 84.56 |
|---|---|
| 5-th percentile | 2928.631 |
| Q1 | 13614.12 |
| median | 119865.41 |
| Q3 | 474720.52 |
| 95-th percentile | 4082635.401 |
| Maximum | 63716144.15 |
| Range | 63716059.59 |
| Interquartile range (IQR) | 461106.4 |
Descriptive statistics
| Standard deviation | 3702672.272 |
|---|---|
| Coefficient of variation (CV) | 4.05307284 |
| Kurtosis | 93.12071134 |
| Mean | 913546.9353 |
| Median Absolute Deviation (MAD) | 114173.16 |
| Skewness | 9.067931485 |
| Sum | 2.496084291e+10 |
| Variance | 1.370978195e+13 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 9465.99 | 2 | < 0.1% | |
| 3529.44 | 2 | < 0.1% | |
| 19634.24 | 2 | < 0.1% | |
| 46602.16 | 2 | < 0.1% | |
| 6022.12 | 2 | < 0.1% | |
| 16507 | 2 | < 0.1% | |
| 12809.78 | 2 | < 0.1% | |
| 7223.46 | 2 | < 0.1% | |
| 256548.81 | 2 | < 0.1% | |
| 3713.49 | 2 | < 0.1% | |
| Other values (27286) | 27303 | 99.9% |
| Value | Count | Frequency (%) | |
| 84.56 | 1 | < 0.1% | |
| 253.45 | 1 | < 0.1% | |
| 331.19 | 1 | < 0.1% | |
| 336.95 | 1 | < 0.1% | |
| 338.22 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 63716144.15 | 1 | < 0.1% | |
| 62505646.52 | 1 | < 0.1% | |
| 62451514.93 | 1 | < 0.1% | |
| 61034457.1 | 1 | < 0.1% | |
| 52288697.89 | 1 | < 0.1% |
| Distinct count | 26361 |
|---|---|
| Unique (%) | 96.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 295318.73312191194 |
|---|---|
| Minimum | 0.0 |
| Maximum | 22743616.17 |
| Zeros | 345 |
| Zeros (%) | 1.3% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 25.605 |
| Q1 | 796.425 |
| median | 10037.85 |
| Q3 | 113317.895 |
| 95-th percentile | 1244389.435 |
| Maximum | 22743616.17 |
| Range | 22743616.17 |
| Interquartile range (IQR) | 112521.47 |
Descriptive statistics
| Standard deviation | 1273010.158 |
|---|---|
| Coefficient of variation (CV) | 4.310631244 |
| Kurtosis | 87.25838413 |
| Mean | 295318.7331 |
| Median Absolute Deviation (MAD) | 10010.49 |
| Skewness | 8.694985241 |
| Sum | 8068993745 |
| Variance | 1.620554862e+12 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 345 | 1.3% | |
| 3 | 19 | 0.1% | |
| 4 | 13 | < 0.1% | |
| 1 | 13 | < 0.1% | |
| 1.24 | 9 | < 0.1% | |
| 6 | 8 | < 0.1% | |
| 1.25 | 7 | < 0.1% | |
| 1.3 | 6 | < 0.1% | |
| 1.21 | 6 | < 0.1% | |
| 1.27 | 6 | < 0.1% | |
| Other values (26351) | 26891 | 98.4% |
| Value | Count | Frequency (%) | |
| 0 | 345 | 1.3% | |
| 1 | 13 | < 0.1% | |
| 1.13 | 1 | < 0.1% | |
| 1.19 | 3 | < 0.1% | |
| 1.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 22743616.17 | 1 | < 0.1% | |
| 21620180.9 | 1 | < 0.1% | |
| 21137400.46 | 1 | < 0.1% | |
| 19498919.53 | 1 | < 0.1% | |
| 18933038.04 | 1 | < 0.1% |
| Distinct count | 26947 |
|---|---|
| Unique (%) | 98.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 290105.93422427995 |
|---|---|
| Minimum | 0.0 |
| Maximum | 20470572.61 |
| Zeros | 172 |
| Zeros (%) | 0.6% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 107.825 |
| Q1 | 2922.98 |
| median | 25688.49 |
| Q3 | 145446.395 |
| 95-th percentile | 1225194.379 |
| Maximum | 20470572.61 |
| Range | 20470572.61 |
| Interquartile range (IQR) | 142523.415 |
Descriptive statistics
| Standard deviation | 1187227.331 |
|---|---|
| Coefficient of variation (CV) | 4.092392437 |
| Kurtosis | 90.56766958 |
| Mean | 290105.9342 |
| Median Absolute Deviation (MAD) | 25203.12 |
| Skewness | 8.868402759 |
| Sum | 7926564441 |
| Variance | 1.409508736e+12 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 172 | 0.6% | |
| 1 | 17 | 0.1% | |
| 2 | 6 | < 0.1% | |
| 1.26 | 4 | < 0.1% | |
| 2.75 | 3 | < 0.1% | |
| 177.87 | 3 | < 0.1% | |
| 5.85 | 3 | < 0.1% | |
| 10.91 | 3 | < 0.1% | |
| 215.36 | 3 | < 0.1% | |
| 5.44 | 3 | < 0.1% | |
| Other values (26937) | 27106 | 99.2% |
| Value | Count | Frequency (%) | |
| 0 | 172 | 0.6% | |
| 1 | 17 | 0.1% | |
| 1.26 | 4 | < 0.1% | |
| 1.28 | 2 | < 0.1% | |
| 1.3 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20470572.61 | 1 | < 0.1% | |
| 20445501.03 | 1 | < 0.1% | |
| 20328161.55 | 1 | < 0.1% | |
| 19900871.87 | 1 | < 0.1% | |
| 18956479.74 | 1 | < 0.1% |
| Distinct count | 17512 |
|---|---|
| Unique (%) | 64.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22158.67818907148 |
|---|---|
| Minimum | 0.0 |
| Maximum | 2546439.11 |
| Zeros | 8496 |
| Zeros (%) | 31.1% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 192.69 |
| Q3 | 5898.3 |
| 95-th percentile | 103556.366 |
| Maximum | 2546439.11 |
| Range | 2546439.11 |
| Interquartile range (IQR) | 5898.3 |
Descriptive statistics
| Standard deviation | 103132.8556 |
|---|---|
| Coefficient of variation (CV) | 4.65428735 |
| Kurtosis | 123.5429418 |
| Mean | 22158.67819 |
| Median Absolute Deviation (MAD) | 192.69 |
| Skewness | 9.77606234 |
| Sum | 605441564.2 |
| Variance | 1.06363859e+10 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 8496 | 31.1% | |
| 2.66 | 11 | < 0.1% | |
| 4 | 9 | < 0.1% | |
| 1.57 | 8 | < 0.1% | |
| 9 | 8 | < 0.1% | |
| 1.65 | 7 | < 0.1% | |
| 3 | 7 | < 0.1% | |
| 1.6 | 7 | < 0.1% | |
| 2 | 7 | < 0.1% | |
| 3.32 | 7 | < 0.1% | |
| Other values (17502) | 18756 | 68.6% |
| Value | Count | Frequency (%) | |
| 0 | 8496 | 31.1% | |
| 0.83 | 1 | < 0.1% | |
| 1 | 5 | < 0.1% | |
| 1.01 | 1 | < 0.1% | |
| 1.09 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2546439.11 | 1 | < 0.1% | |
| 1993645.36 | 1 | < 0.1% | |
| 1896149.5 | 1 | < 0.1% | |
| 1880231.38 | 1 | < 0.1% | |
| 1811090.71 | 1 | < 0.1% |
| Distinct count | 27148 |
|---|---|
| Unique (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 305873.958118435 |
|---|---|
| Minimum | 0.0 |
| Maximum | 23472988.69 |
| Zeros | 15 |
| Zeros (%) | 0.1% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 916.947 |
| Q1 | 7703.605 |
| median | 47750.39 |
| Q3 | 146102.06 |
| 95-th percentile | 1289527.221 |
| Maximum | 23472988.69 |
| Range | 23472988.69 |
| Interquartile range (IQR) | 138398.455 |
Descriptive statistics
| Standard deviation | 1274850.96 |
|---|---|
| Coefficient of variation (CV) | 4.16789637 |
| Kurtosis | 119.753632 |
| Mean | 305873.9581 |
| Median Absolute Deviation (MAD) | 44335.67 |
| Skewness | 10.08876168 |
| Sum | 8357394158 |
| Variance | 1.62524497e+12 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 15 | 0.1% | |
| 300 | 5 | < 0.1% | |
| 990 | 5 | < 0.1% | |
| 550 | 4 | < 0.1% | |
| 916.67 | 4 | < 0.1% | |
| 266.67 | 4 | < 0.1% | |
| 453.33 | 4 | < 0.1% | |
| 436.67 | 3 | < 0.1% | |
| 613.33 | 3 | < 0.1% | |
| 803.33 | 3 | < 0.1% | |
| Other values (27138) | 27273 | 99.8% |
| Value | Count | Frequency (%) | |
| 0 | 15 | 0.1% | |
| 3.09 | 1 | < 0.1% | |
| 3.11 | 1 | < 0.1% | |
| 3.19 | 1 | < 0.1% | |
| 3.33 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 23472988.69 | 1 | < 0.1% | |
| 21625372.67 | 1 | < 0.1% | |
| 20597427.49 | 1 | < 0.1% | |
| 20597401.03 | 1 | < 0.1% | |
| 19733973.9 | 1 | < 0.1% |
| Distinct count | 26348 |
|---|---|
| Unique (%) | 96.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 218698.1972565238 |
|---|---|
| Minimum | 0.0 |
| Maximum | 15436246.72 |
| Zeros | 159 |
| Zeros (%) | 0.6% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 406.67 |
| Q1 | 5283.05 |
| median | 32231.5 |
| Q3 | 104842.35 |
| 95-th percentile | 1007995.497 |
| Maximum | 15436246.72 |
| Range | 15436246.72 |
| Interquartile range (IQR) | 99559.3 |
Descriptive statistics
| Standard deviation | 888129.2037 |
|---|---|
| Coefficient of variation (CV) | 4.060980908 |
| Kurtosis | 106.4973823 |
| Mean | 218698.1973 |
| Median Absolute Deviation (MAD) | 30926.24 |
| Skewness | 9.576043389 |
| Sum | 5975490844 |
| Variance | 7.887734825e+11 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 159 | 0.6% | |
| 203.33 | 11 | < 0.1% | |
| 223.33 | 10 | < 0.1% | |
| 533.33 | 10 | < 0.1% | |
| 196.67 | 8 | < 0.1% | |
| 103.33 | 8 | < 0.1% | |
| 263.33 | 8 | < 0.1% | |
| 326.67 | 8 | < 0.1% | |
| 300 | 8 | < 0.1% | |
| 216.67 | 8 | < 0.1% | |
| Other values (26338) | 27085 | 99.1% |
| Value | Count | Frequency (%) | |
| 0 | 159 | 0.6% | |
| 2.52 | 1 | < 0.1% | |
| 2.57 | 1 | < 0.1% | |
| 2.73 | 1 | < 0.1% | |
| 2.79 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 15436246.72 | 1 | < 0.1% | |
| 15264523.33 | 1 | < 0.1% | |
| 13384586.8 | 1 | < 0.1% | |
| 13377154.27 | 1 | < 0.1% | |
| 13110016.21 | 1 | < 0.1% |
| Distinct count | 23042 |
|---|---|
| Unique (%) | 84.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82025.37220766387 |
|---|---|
| Minimum | 0.0 |
| Maximum | 8378355.78 |
| Zeros | 2952 |
| Zeros (%) | 10.8% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 277.37 |
| median | 4312.49 |
| Q3 | 32684.94 |
| 95-th percentile | 301699.133 |
| Maximum | 8378355.78 |
| Range | 8378355.78 |
| Interquartile range (IQR) | 32407.57 |
Descriptive statistics
| Standard deviation | 391735.6238 |
|---|---|
| Coefficient of variation (CV) | 4.775785994 |
| Kurtosis | 160.8942619 |
| Mean | 82025.37221 |
| Median Absolute Deviation (MAD) | 4312.49 |
| Skewness | 11.30554045 |
| Sum | 2241179245 |
| Variance | 1.534567989e+11 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 2952 | 10.8% | |
| 3.33 | 256 | 0.9% | |
| 6.67 | 125 | 0.5% | |
| 10 | 78 | 0.3% | |
| 13.33 | 49 | 0.2% | |
| 4.44 | 40 | 0.1% | |
| 16.67 | 32 | 0.1% | |
| 6.66 | 28 | 0.1% | |
| 20 | 25 | 0.1% | |
| 26.67 | 24 | 0.1% | |
| Other values (23032) | 23714 | 86.8% |
| Value | Count | Frequency (%) | |
| 0 | 2952 | 10.8% | |
| 0.97 | 1 | < 0.1% | |
| 1.3 | 1 | < 0.1% | |
| 1.33 | 1 | < 0.1% | |
| 1.35 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 8378355.78 | 1 | < 0.1% | |
| 7958753.83 | 1 | < 0.1% | |
| 7864297.23 | 1 | < 0.1% | |
| 7806415.69 | 1 | < 0.1% | |
| 7790540.1 | 1 | < 0.1% |
| Distinct count | 9115 |
|---|---|
| Unique (%) | 33.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5150.387570910954 |
|---|---|
| Minimum | 0.0 |
| Maximum | 844929.83 |
| Zeros | 16567 |
| Zeros (%) | 60.6% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 450.665 |
| 95-th percentile | 17612.705 |
| Maximum | 844929.83 |
| Range | 844929.83 |
| Interquartile range (IQR) | 450.665 |
Descriptive statistics
| Standard deviation | 30719.20777 |
|---|---|
| Coefficient of variation (CV) | 5.964445849 |
| Kurtosis | 226.5085073 |
| Mean | 5150.387571 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.1689564 |
| Sum | 140724039.6 |
| Variance | 943669725.8 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 16567 | 60.6% | |
| 3.33 | 152 | 0.6% | |
| 6.67 | 100 | 0.4% | |
| 10 | 57 | 0.2% | |
| 13.33 | 42 | 0.2% | |
| 1.11 | 29 | 0.1% | |
| 20 | 29 | 0.1% | |
| 16.67 | 18 | 0.1% | |
| 2.22 | 18 | 0.1% | |
| 5 | 15 | 0.1% | |
| Other values (9105) | 10296 | 37.7% |
| Value | Count | Frequency (%) | |
| 0 | 16567 | 60.6% | |
| 1 | 2 | < 0.1% | |
| 1.11 | 29 | 0.1% | |
| 1.26 | 1 | < 0.1% | |
| 1.3 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 844929.83 | 1 | < 0.1% | |
| 751144.1 | 1 | < 0.1% | |
| 745488.94 | 1 | < 0.1% | |
| 717175.84 | 1 | < 0.1% | |
| 716104.2 | 1 | < 0.1% |
type
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 213.5 KiB |
| conventional | |
|---|---|
| organic |
| Value | Count | Frequency (%) | |
| conventional | 13662 | 50.0% | |
| organic | 13661 | 50.0% |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 9.500091498 |
| Min length | 7 |
year
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.956593346265 |
|---|---|
| Minimum | 2015 |
| Maximum | 2019 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 213.5 KiB |
Quantile statistics
| Minimum | 2015 |
|---|---|
| 5-th percentile | 2015 |
| Q1 | 2016 |
| median | 2017 |
| Q3 | 2018 |
| 95-th percentile | 2019 |
| Maximum | 2019 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.406538837 |
|---|---|
| Coefficient of variation (CV) | 0.000697357019 |
| Kurtosis | -1.282566679 |
| Mean | 2016.956593 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.04300041133 |
| Sum | 55109305 |
| Variance | 1.978351501 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2017 | 5616 | 20.6% | |
| 2016 | 5616 | 20.6% | |
| 2015 | 5615 | 20.6% | |
| 2018 | 5292 | 19.4% | |
| 2019 | 5184 | 19.0% |
| Value | Count | Frequency (%) | |
| 2015 | 5615 | 20.6% | |
| 2016 | 5616 | 20.6% | |
| 2017 | 5616 | 20.6% | |
| 2018 | 5292 | 19.4% | |
| 2019 | 5184 | 19.0% |
| Value | Count | Frequency (%) | |
| 2019 | 5184 | 19.0% | |
| 2018 | 5292 | 19.4% | |
| 2017 | 5616 | 20.6% | |
| 2016 | 5616 | 20.6% | |
| 2015 | 5615 | 20.6% |
| Distinct count | 78 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 213.5 KiB |
| Chicago | 506 |
|---|---|
| Boston | 506 |
| Spokane | 506 |
| Denver | 506 |
| Seattle | 506 |
| Other values (73) |
| Value | Count | Frequency (%) | |
| Chicago | 506 | 1.9% | |
| Boston | 506 | 1.9% | |
| Spokane | 506 | 1.9% | |
| Denver | 506 | 1.9% | |
| Seattle | 506 | 1.9% | |
| Nashville | 506 | 1.9% | |
| Houston | 506 | 1.9% | |
| Orlando | 506 | 1.9% | |
| Plains | 506 | 1.9% | |
| Albany | 506 | 1.9% | |
| Other values (68) | 22263 | 81.5% |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 10.81103832 |
| Min length | 4 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Date | AveragePrice | Total Volume | 4046 | 4225 | 4770 | Total Bags | Small Bags | Large Bags | XLarge Bags | type | year | region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2015-01-04 | 1.22 | 40873.28 | 2819.50 | 28287.42 | 49.90 | 9716.46 | 9186.93 | 529.53 | 0.0 | conventional | 2015 | Albany |
| 1 | 2015-01-11 | 1.24 | 41195.08 | 1002.85 | 31640.34 | 127.12 | 8424.77 | 8036.04 | 388.73 | 0.0 | conventional | 2015 | Albany |
| 2 | 2015-01-18 | 1.17 | 44511.28 | 914.14 | 31540.32 | 135.77 | 11921.05 | 11651.09 | 269.96 | 0.0 | conventional | 2015 | Albany |
| 3 | 2015-01-25 | 1.06 | 45147.50 | 941.38 | 33196.16 | 164.14 | 10845.82 | 10103.35 | 742.47 | 0.0 | conventional | 2015 | Albany |
| 4 | 2015-02-01 | 0.99 | 70873.60 | 1353.90 | 60017.20 | 179.32 | 9323.18 | 9170.82 | 152.36 | 0.0 | conventional | 2015 | Albany |
| 5 | 2015-02-08 | 0.99 | 51253.97 | 1357.37 | 39111.81 | 163.25 | 10621.54 | 10113.10 | 508.44 | 0.0 | conventional | 2015 | Albany |
| 6 | 2015-02-15 | 1.06 | 41567.62 | 986.66 | 30045.51 | 222.42 | 10313.03 | 9979.87 | 333.16 | 0.0 | conventional | 2015 | Albany |
| 7 | 2015-02-22 | 1.07 | 45675.05 | 1088.38 | 35056.13 | 151.00 | 9379.54 | 9000.16 | 379.38 | 0.0 | conventional | 2015 | Albany |
| 8 | 2015-03-01 | 0.99 | 55595.74 | 629.46 | 45633.34 | 181.49 | 9151.45 | 8986.06 | 165.39 | 0.0 | conventional | 2015 | Albany |
| 9 | 2015-03-08 | 1.07 | 40507.36 | 795.68 | 30370.64 | 159.05 | 9181.99 | 8827.55 | 354.44 | 0.0 | conventional | 2015 | Albany |
Last rows
| Date | AveragePrice | Total Volume | 4046 | 4225 | 4770 | Total Bags | Small Bags | Large Bags | XLarge Bags | type | year | region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 27313 | 2015-10-18 | 2.02 | 7664.36 | 1523.54 | 3491.30 | 0.00 | 2649.52 | 2606.66 | 42.86 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27314 | 2015-10-25 | 2.00 | 6447.44 | 1235.04 | 2895.73 | 0.00 | 2316.67 | 2316.67 | 0.00 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27315 | 2015-11-01 | 1.92 | 7296.25 | 1652.42 | 3123.83 | 0.00 | 2520.00 | 2520.00 | 0.00 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27316 | 2015-11-08 | 1.98 | 7603.07 | 2198.14 | 3139.24 | 26.37 | 2239.32 | 2223.34 | 15.98 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27317 | 2015-11-15 | 1.92 | 8175.94 | 1925.21 | 3271.43 | 16.72 | 2962.58 | 2946.66 | 15.92 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27318 | 2015-11-22 | 1.97 | 6249.43 | 1733.40 | 2873.92 | 30.95 | 1611.16 | 1590.00 | 21.16 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27319 | 2015-11-29 | 2.08 | 4638.10 | 1395.02 | 2238.04 | 61.71 | 943.33 | 943.33 | 0.00 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27320 | 2015-12-13 | 1.80 | 7836.65 | 2194.49 | 2981.01 | 25.97 | 2635.18 | 2598.45 | 36.73 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27321 | 2015-12-20 | 1.92 | 6255.19 | 1512.45 | 2407.32 | 11.78 | 2323.64 | 2213.72 | 109.92 | 0.0 | organic | 2015 | WestTexNewMexico |
| 27322 | 2015-12-27 | 1.81 | 7155.63 | 1478.79 | 2629.64 | 14.10 | 3033.10 | 2855.55 | 177.55 | 0.0 | organic | 2015 | WestTexNewMexico |
Most frequent
| Date | AveragePrice | Total Volume | 4046 | 4225 | 4770 | Total Bags | Small Bags | Large Bags | XLarge Bags | type | year | region | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2017-02-26 | 1.16 | 39054.83 | 3021.26 | 15568.68 | 11.77 | 20453.12 | 20299.52 | 153.60 | 0.0 | organic | 2017 | West Tex/New Mexico | 2 |
| 1 | 2017-03-05 | 1.23 | 24969.30 | 2292.42 | 4876.69 | 52.82 | 17747.37 | 17114.89 | 632.48 | 0.0 | organic | 2017 | West Tex/New Mexico | 2 |